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S 1 

Research in quantum information science often leads to global optimization problems, and one of the main examples 

of this is the calculation of numerical measures of quantum correlations. One class of such measures is entanglement 

measures of pure multipartite quantum states. For instance, the calculation of the geometric entanglement measures 

l—> for pure states [l| and the calculation of the minimal measurement entropy [2|, which is considered in this paper, 

represents the global optimization problems. The second one is measures of mixed states. Most of such measures are 

based on minimization. Sometimes it's possible to use special optimization methods like convex optimization [3(, but 

y—{ ' in most cases we need to use general algorithms. For example, global minimization is needed to calculate the relative 

^- , entropy of entanglement [J| , geometric measures of entanglement |l| , any convex roof of measure for pure states (for 

C*~) ' example, the entanglement of formation [5|). 

The third type of quantum correlation measures is quantum discord, which was proposed independently by G] 

and [7|. Quantum discord is based not on entanglement, but on the existence of non-commutative quantum mea- 

' . | surements, so separable (not entangled) states can have non-zero discord. There is a supposition that not only 

\l ■ entanglement, but also quantum discord can be the resource for computations that outperform classical ones [8|. The 

definition of quantum discord is based on the minimization, and the analytical calculations was made only for a few 

classes of quantum states [9l4ll|. that's why it's perspective to use modern global optimization algorithms. 

. . ' It's well known that there is no universal method for finding global extremum of an arbitrary function. Till last years 

gradient methods (Newton's method, gradient descent, conjugate gradient method, etc.) were the most popular for 

optimization problems [12| . However, these methods are adapted for finding a local extremum or a global extremum 

5_j \ of very specific functions. 

Nevertheless, recently the increasing computational power allows to use more resource-intensive, but more effec- 
tive algorithms. Generally, such algorithms use biological or physical heuristics. The most popular algorithms are 
ant colony optimization [lj§], genetic algorithm [14j . particle swarm optimization [15|], quantum annealing [16| and 
simulated annealing 17]. 

We presents random mutations algorithm, which is the simplified genetic algorithm. The algorithm is being tested 
on standard global optimization problems and on the computation of the minimal measurement entropy, which is the 
entanglement monotone for pure multipartite states [2|, Il8| . After that the algorithm is used to compute quantum 
discord and tested on two-qubit states with maximally mixed subsystems. 
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2. RANDOM MUTATIONS OPTIMIZATION 
2.1. General algorithm 

Let's describe the random mutations minimization algorithm. 

Consider a general real global optimization problem. For a given rt-parameter function /(x) (fitness function) we 
need to find a vector x = {xi, x%, . . . , X n } such that 

/(x)=min/(y). (2.1) 

v 

The general random mutations algorithm consists of the following steps: 



1. Initialization. Generate n pop random real vectors Xi of dimension n, i € {l,n pop }. 

2. Mutation. From each Xi generate a set Di of rides vectors of dimension n (descendants of xi). Each vector of 
Di is independently generated from Xi by a probabilistic mutation process. We can also include Xi into Di. 

3. Selection. From each set Di we choose one "winner" vector and take it as Xi for the next generation. Also we 
store the best one of new Xi vectors (the vector with minimal /) as Xbest- 

4. Termination. If the termination conditions are satisfied, we take Xb est as the result, else go to the step 2. 

As it can be noticed, this algorithm is very similar to multipopulation genetic algorithm without migration and 
crossover. 

The most important and specific steps are mutation ^j and termination conditions (J3J). Let's describe all steps. 

2.2. Algorithm steps 

2.2.1. Initialization 

The process of the generation of initial vectors is probabilistic. We can choose different distributions according to 
a minimization problem. But almost always we can take uniform distribution on [v m i n ,v max ] for each component of 

X i . 

2.2.2. Mutation 

We use some special adaptive type of mutation. First of all, we must choose the distribution NM for the number of 
components being mutated. For example we can choose uniform distribution on [l,n maxmut ]. This is very important 
parameter of algorithm and, of course, must be chosen according to the length of the vector. 

Every new vector x,j £ Gi is generated from Xi by the following procedure: we randomly choose n mut ~ NM 
components of the vector Xj and then independently change each of them by the rule 

Vnew = Void + m ■ hP ', (2.2) 

where m (magnitude) and p (power) are uniformly distributed on [—1,1] and [pmimPmax] respectively. The p ma x 
parameter always depends on the minimization problem and is important. We usually take p m in = — 9 and the base 
b = 10. 

Such type of mutation is very similar to decimal or binary coding in genetic algorithms. It helps to adapt mutation 
amplitude during the algorithm: in the beginning even big mutations can win, but when the algorithm is near 
minimum almost only small steps can decrease the target function. But it's important that even during the final steps 
algorithm can find an appropriate large mutation (this is good for challenging local minima problem). 

2.2.3. Selection 

We use the simplest selection process: the vector with minimal fitness is taken as a winner. Another selection can 
be used, for example, winner can be chosen probabilistically according to its fitness. 



2.2-4- Termination conditions 

Every standard termination condition can be used, but the best among experiments was the next one: the algorithm 
stops if for every of n sta u last generations the xi, est was changed on the value lesser than eps. 
This termination condition is very effective against local minimum problem. 

2.3. Features and parameters 

The main advantages of random mutations algorithm are simplicity and easy parallelization. As a consequence 
the algorithm can be effectively realized and used on high-performance distributed computing systems. As it will be 
shown further, in spite of its simplicity, random mutations can outperform more complex algorithms. 

The main parameter influencing the work of the algorithm is n pop . Obviously, increase of n pop leads to higher 
convergence speed, but also increases the probability to converge to local extrema. The second important parameter 
is the number of descendants n,j es . Like population size parameter of genetic algorithms it strongly depends on the 
optimization problem and in most cases it must be tuned manually to obtain better results. The parameters n max mut 
and Pmax can be chosen accordingly to the number of arguments and the scale of a fitness function. 

Notice that n pop subpopulations of random mutations algorithm are not totaly independent, because they are linked 
by termination conditions. That's why the algorithm is not similar to the simple repeating of the single population 
optimization procedure many times. Some subpopulations can converge to "bad" local minimums, near which the 
convergence is slow, and termination conditions stops the algorithm in such cases. 

3. TESTING ON STANDARD FUNCTIONS 

Random mutations algorithm was tested on different standard test functions [13, 123] : 



n-1 

E 



F RoS enbrock{x) = £ (lOOfci+l - X 2 ) 2 + (1 - X t ) 2 ) , 



(3.1) 



min F Rosenbrock (x) = F(l, 1, . . . , 1) = 0; 

-2.048<:r;<2.048 



F R astrigin{x) = 10n + J2 ( X f ~ 10 COS(2lTXi)) , 



min F Rastrl gin (x) = F(0, 0, . . . , 0) = 0; 



n 2 n / \ 

F G riewank{x) = 1 + E 4TO0 " II ( C0S (t*) > 
i=\ i=l K v ' 



min F Griewank (x) = F(0, 0, . . . , 0) = 0; 

-512<z;<512 



Fschwefei(x) = 418.98288727 • n - £ a* sin (/R) , 
min F Schwefel (x) = ^(420.968750, ..., 420.968750) = 0. 

-512<Xi<512 ' 



(3.2) 



(3.3) 



(3.4) 



Of course, the assessment of the dependence of the accuracy from the number of algorithm iterations is not good, 
because the single iteration can be very difficult. So, we need to research the dependency of the accuracy from the 
number of fitness function evaluations. The main task of standard tests was only the verification of the correctness 
and competitiveness of the random mutations algorithm. The test functions was taken with the number of parameters 
n — 50. We used the standard MATLAB genetic algorithm. The size of the population was 5000, other parameters 
was standard. The parameters for random mutations were as follows: n pop = 40, n ma xmut — 5, for the Rosenbrock 
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Figure 1: The convergence of the random mutations and genetic algorithm on standard test problems. 

and Schwefel functions rides = 20, for the Rosenbrock and Griewank functions rides = 10. For both algorithms 
n staii = 50, e — 10~ 6 , but algorithms have different termination conditions. The MATLAB genetic algorithm stops if 
the averaged change of the best fitness over last n sta u generations is lesser than eps. The test results averaged over 20 
experiments are plotted on Fig. [1] The graphs for each algorithm are presented for the minimal number of steps over 
experiments. We take F, Griewank (400 ■ x) to preserve scale The constraints of Schwefel function (as opposed to other 
test functions) is significant. Moreover the global minimum is closer to the bound than to the center of the search 
space, so the scaling begin to influence results. We take Fschwefei (350 • x) for genetic algorithm and Fs c hwefei(100 • x) 
for random mutations. The best and average results over experiments are presented on Tab. [I] 





Rastrigin 


Rosenbrok 


Griewank 


Schwefel 


Genetic algorithm 


fbest = 0.0781 

favg = 0.1983 


fbest = 37.7922 
favg = 49.8762 


fbest = 0.3377 

favg = 1.0207 


fbest = 18.4127 
favg = 53.9818 


Random mutations 


fbest = 0.00002 

favg = 0.00006 


fbest = 32.7622 
favg = 42.4590 


fbest = 0.0194 

favg = 0.1171 


fbest = 0.0006 
favg = 0.0007 



Table I: The best and average results of genetic and random mutations algorithm on test functions over 20 experiments. 



This method of assessment is good for the test functions, which have regular structure. However, it is not so good 
for some real tasks. Consider a function with a local minima "plateau" and a global minima with a close value. The 
sample of such function is illustrated on Fig. [5] 

Global optimization algorithms are probabilistic and with large probability converge to local minimums of such 
functions. But the probability of the correct answer is nonzero. So we need to assess the dependence of this probability 
from the number of fitness function evaluations. The exact method of such assessment will be described in Sec. 14.31 




Figure 2: Function which is difficult for finding the global minimum. 



4. CALCULATION OF MINIMAL MEASUREMENT ENTROPY FOR PURE STATES 

4.1. Definition 



Let 



/ J "■iii 2 —i n |*1*2 ••• I'm) 



ii,i2,...,in— 



be a pure quantum n-qubit state (the definition for qudits is the same) 
The real positive number 



(4.1) 



H meas (\ip}) = - ^ \iii2---in\ 2 log\iii 2 ■■■i n f 

ii,i2,...,i n —0 



(4.2) 



is called a measurement entropy. 

Then the minimal measurement entropy over all orthogonal local measurement basis sets 

EHmin{\<l>)) = T1 mm H meaa {U x ® U 2 <g> . . . <8> U n \ip)) (4.3) 

is an entanglement measure (monotone) [2j. Here Ui are unitary matrices. 

4.2. Calculation 

For an arbitrary state 14. ll we need to find the global minima of H meas (Ui <g> XJi (g) . . . <8> U n \^})) over all possible sets 
of the 2x2 unitary matrices U\, U%, . . . , U n . 

The parametrization of 2 x 2 unitary matrices is well known: 



U(a,(3,6, 1 ) = e i 



e i{-P-S) C o S 7 - e K-P+S) S in 7 



(4.4) 



where /3, S, 7 are real numbers. We can easily see that a and /? doesn't affect H meas , so the final parametrization for 
our task is 



U{8,i) 



e I<5 cos7 — e ll5 sm7 
e _ '" 5 sin7 e ll5 cos7 



(4.5) 



and we need to solve the global minima problem for 2n real parameters. 

If we want to calculate Enmin for qudits (d > 2) we can use the parametrization U — e lH , where H is Hcrmitian 
and can be obviously parameterized by d 2 real numbers. 



4.3. Testing and results 

The main task of this work is testing of the random mutations algorithm, so we will present only test results, more 
calculations of E Hmin can be found in @, [l8| . 

The validity of calculations was tested on four cases: 

1. Unentengled states. Of course, for unentangled states minimal measurement entropy is equal to zero: 

E H rmn{Ul <g> U 2 ® . . . ® U n \00 . . . 0)) = 0. (4.6) 

2. GHZ states. For the generalized GHZ-states \GHZ) = A o |00 . . . 0) + A x |00 . . .0) : 

E Hmm (\GHZ)) = H sh (A , Xi) = -|A | 2 log |A | 2 - |A X | 2 log |A X | 2 . (4.7) 

3. Bipartite states. For bipartite states EHmin equals reduced von Neumann entropy. 

4. Additivity. For arbitrary n-qubits state \ip) and fc-qubits state \4>) for the (n + A:)-qubits state \ip) <3> \4>) '■ 

E H rmn{W) ® W) = E Hmm (\ip)) + E Hmin (\tp}). (4.8) 

For all tests the random mutations algorithm gives correct results (tests were made for up to 17 qubits, the 
accuracy was 10 -5 ). For these test states the algorithm shows a very good convergence. But some other states were 
very difficult, because the corresponding fitness function is of the type described in [3] (with local minima "plateau"). 
For example, for one state of 7-qubit Grover's algorithm the most possible result of the algorithm is 4.0220, but there 
is another minimum 3.968. We suppose that the second one is global, because of large amount of tests. Let's describe 
the method of quality assessment of global optimization algorithms on such problems, which was claimed in Sec. |31 

Consider n exp numerical experiments. It's obvious that the probability of the error decreases exponentially with 
repetition of optimization and we can take up the mean value of fitness calculations for achieving the success probability 
rate 0.5. 

£-0.5 — ^evaluations / T^exp ' '0^70.5 y^err j "^exp) i V ^J 

where n eva iuations is the summarized number of fitness function evaluations during all experiments, n err is the number 
of incorrect results. I.e. n eva iuations / n exp is the average number of fitness function evaluations during one experiment, 
n err /n exp is the probability of error. 

As there is no analytical values of EHmin for computationally difficult states, we take the best result after 10000 
repeats of genetic, swarm particle, and random mutations algorithm for the "correct" global minimum. 

Here is the example results for 17th step of 11-qubits Grover's algorithm. 

• genetic algorithm: eo.5 ~ 5500 

• swarm particle optimization: eo.5 ~ 11000 

• random mutations: eo.5 ~ 3500. 

Other hard states give similar results. 

Random mutations algorithm is very simple, but it demonstrates significant improvement for such problems. We 
must note that only a very simple type of swarm optimization was used, so different types and topologies probably 
may show better results. 

Described tests show us that we can use random mutations algorithm to solve some difficult optimization problems 
of quantum information theory. 

5. CALCULATION OF QUANTUM DISCORD 
5.1. Definition 

Discord is a measure of non-classical correlations in quantum systems. Consider the bipartite mixed quantum state 
p. Discord represents the difference between quantum mutual information and classical correlations: 



D{p)=I(p)-C{p). (5.1) 

The I(p) = S(pa) + S(pb) — S(p) is the quantum mutual information. Here S(p) = Trplogp is von Neumann 
entropy, pa and pb are reduced density matrices of subsystems. 

The equation for classical correlations C(p) is not so simple. First of all it's not symmetric. Let's choose subsystem B 
and make some projective measurement with the observable M, this measurement will destruct quantum correlations. 
Classical correlations after the measurement will be defined by Cm(p) = S(pa) — S(p\M), where S(p\M) = ^2piS(pi), 

i 

is a quantum conditional entropy, pi and pi are the probabilities and results of the measurement M. To calculate C(p) 
and discord itself we need to get the maximal Cm(p), so C{p) — S(pa) — min S(p\M). 

5.2. Testing 

We use random mutations optimization to calculate quantum discord. To do it we need to parameterize an 
observable M. The simplest way is to parameterize n x n hermitian matrix by n 2 real numbers: n on the diagonal 
and n(n — 1) above the diagonal. This method is redundant, because we only need the projectors of M, but not its 
eigenvalues, but this redundancy only adds some additional work for the optimization algorithm, but it obviously 
doesn't affect the result. 

The analytical results of Luo [9fl were used to test the possibility of the random mutations algorithm to compute 
quantum discord of two-qubit systems . Consider a state 

1 3 

p=-(I + Y,Cj<J 3 <E><J 3 ). (5.2) 

.7=1 

This is a state with maximally mixed subsystems. The quantum mutual information of this state is 

3 

/(p)=2 + ^A i log 2 A i) (5.3) 

»=o 

where 

A = |(1 -C1-C2- c 3 ), Ai = i(l - c\ + c 2 + c 3 ), 

(5.4) 

A2 = j(l + ci - c 2 + c 3 ), A 3 = |(1 + a + c 2 - c 3 ) 



are the eigenvalues of p 
The constraints of th 
For the classical correlations 



The constraints of the coefficients Cj are such that < Aj < 1. 



C(p) = \[{l- c) log 2 (l - c) + (1 + c) log 2 (l + c)], (5.5) 



where c = max{c\, c 2 , C3}. 

By definition, D(p) = I(p) — C(p). It's important to notice that some important quantum spin systems can be 
described by 15.21 so the value of discord for these systems can be calculated analytically [2l| . 

Let's describe tests carried out on such states. We simply generate random sets {01,02,03}, where each Cj is 
uniformly distributed on [—1,1]. If the corresponding density matrix is valid we calculate discord for it and compare 
its value with the analytical result. More than 5000 states was checked. The parameters of algorithm was as follows: 
njmut = 5, njpop — 10, n-des = 10. In all experiments the desired accuracy was achieved: difference between 
analytical and numerical results was lesser then 10~ 6 . 



6. CONCLUSIONS 

The random mutations global optimization algorithm was presented, which is a kind of simplified multipopulation 
genethic algorithm. It gives the correct results and outperforms simple genetic optimization on standard tests and 
on the computation of minimal measurement entropy, which is the entanglement monotone for multipartite pure 
states. The algorithm was also used to compute quantum discord and tested on analytically known discord values 
of two-qubit states with maximally mixed subsystems. Thus, we can conclude that random mutations algorithm is 
perspective to solve some difficult problems of quantum information theory like calculation of discord and different 
entanglement measures for multipartite states. 
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